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The present pandemic has tremendously raised the health systems’ burden 
around the globe. It is important to understand the transmission dynamics of 
the infection and impose localized strategies across different geographies to 
curtail the spread of the infection. The present study was designed to assess 
the transmission dynamics and the health systems’ burden of severe acute 
respiratory syndrome coronavirus 2 (SARS-CoV-2) using an agent-based 
modeling (ABM) approach. The study used a synthetic population with 
31,738,240 agents representing 90.67 percent of the overall population of 
Telangana, India. The effects of imposing and lifting lockdowns, non- 
pharmaceutical interventions, and the role of immunity were analyzed. The 
distribution of people in different health states was measured separately for 


SARS-CoV-2 ; each district of Telangana. The spread dramatically increased and reached a 
Non-pharmaceutical peak soon after the lockdowns were relaxed. It was evident that is the 
interventions protection offered is higher when a higher proportion of the population is 
India exposed to the interventions. ABMs help to analyze grassroots details 
compared to compartmental models. Risk estimates provide insights on the 
proportion of the population protected by the adoption of one or more of the 

control measures, which is of practical significance for policymaking. 
This is an open access article under the CC BY-SA license. 
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1. INTRODUCTION 

The first case of coronavirus disease (COVID-19) in India was reported on January 30, 2020, post 
which “public health emergency of international concern” was declared by the World Health Organization 
(WHO) considering the impact it could create globally [1]. The epidemic has spread across 221 countries 
with 228,946,779 reported cases and 4,700,214 mortalities globally as of September 19, 2021 [2]. In India, 
total infections reported are 1,73,06,420 with 28,07,388 active cases, 1,42,96,703 recoveries and 
1,95,118 deaths, till April 25, 2021 [3]. The influx of infections has bothered the countries with a denser 
population [1]. Several factors such as gender, pollution level, viral load, comorbidities, and others also 
govern the intensity and duration of infection [4]. A larger proportion of the infected people remaining 
asymptomatic has further raised serious concerns as they are indistinguishable and act as the potential 
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sources of transmitting the infection [4], [5]. The healthcare fraternity, researchers, and policymakers from 
several domains have been trying hard to curtail the transmission of the infection completely [6]. Simulation 
studies in the past have been successful in addressing issues like preparing evacuation plans for airborne 
infections [7], devising vaccination strategies for influenza [8], smallpox [9], containing measles [10], and 
tuberculosis [11]. Agent-based models (ABM) have supported various application areas such as dynamics of 
transmission [12], tracking contacts [13], scheduling time and geography dependent contacts [12], [14], 
planning non-pharmaceutical interventions (NPI) [14], [15] such as enforcing lockdowns [12], [13], using 
face masks, and adopting social distancing [14], and shielding the susceptible population [14], [16]. 

In India, most researches have employed a compartmental approach based on the susceptible (S), 
infective (I), and recovered (R) model with modifications such as susceptible (S), exposed (E), infective (I), 
and recovered (R) (SEIR) [17], susceptible (S), hospitalized or quarantined (H), symptomatic (I), purely 
asymptomatic (P), exposed (E), recovered (R) and deceased (D) (SIPHERD) [18], and analytical models 
[19]. Simulations are used to study the real-life systems’ behavior in its existing state and upon implementing 
modifications without associated risks and investment of time and cost [20], [21]. However, the accuracy is 
subject to authenticity of data, constraints, and assumptions [21]. Complex and dynamic problems could be 
effectively addressed using simulations [22]. ABM, discrete event simulation (DES), and system dynamics 
(SD) are three broad classifications of simulations. ABMs entitles the users to define agent-level details [22], 
[23] and are capable of reporting details of individual agents while DES and SD provide only collective 
measures [24]. Each agent in the population can be simulated based on different conditions and can be made 
to perform different actions. A bottom-up approach is employed by ABM wherein the behavior of each agent 
contributes to the behavior of the system. Each agent holds a specified state at any instant of the simulation 
[23]. Technological advancements have improved the capability of systems to handle complex models [22]. 


2. RESEARCH METHOD 

From the literature, it was evident that most of the studies to assess the transmission dynamics of 
infectious diseases employ compartmental models, which fail to incorporate agent-level details. Hence, the 
present study aims to provide an ABM-based simulation to estimate the spread of COVID-19 by developing 
a disease model and simulating it using Python. Such simulation studies based on synthetic populations could 
be helpful for the policymakers and healthcare systems to equip themselves based on the estimates. The 
present study simulates agents of the synthetic population that represent 90.67% (n=35,003,674) of the 
overall population of a state. The parameters such as age and geographic information system (GIS) 
coordinates have been mapped to each individual in the population to ensure exact representation of the state. 
The incorporation of such agent-level details would help in effectively devising policies locally. Analyzing 
the NPIs and risk estimates have practical significance in terms of policymaking and governance. In 
accordance with the Swiss cheese model, the combined effect of multiple interventions on curtailing the 
spread of infection has been analyzed. Hospitals have been benefited from this approach of setting up 
multiple defense strategies [25], [26]. 


2.1. Research design 

An ABM approach is employed to assess the outbreak of COVID-19 and its burden on health 
systems using Telangana state’s synthetic population as shown in Table 1. The code for simulation was 
developed in python, an object-oriented programming (OOP) language using PyCharm, an integrated 
development environment. The model was simulated for 365 days for various lockdown strictness as per the 
Indian scenario [27], [28]. The main functionalities of the code involve creating agents, defining contact 
networks, developing a disease model, devising interventions, and simulating. Transparency of code, 
assumptions, variables, and scope of the study are retained throughout in adherence with the ethical good 
practices in modeling and the International Society for Pharmacoeconomics and Outcomes Research 
(ISPOR-SMDM) modeling good research practices [29]-[31]. 


2.2. Agent creation 

The main idea to employ an ABM was to represent the population of a state by defining the actual 
attributes to each of them. Data of 31,738,270 people were taken from the 2011 census of India to generate 
the synthetic population of Telangana. Unique identifiers for person and household, district codes, and 
geocoordinates were mapped to the agents. During the data cleansing process, 30 invalid entries were 
eliminated to obtain 35,003,674 valid records that represent 90.67% of the state’s population as shown in 
Table 1 [32], [33]. 


An agent-based model to assess corona virus disease 19 spread and ... (Madhavarao Seshadri Narassima) 


4120 O ISSN: 2088-8708 


Table 1. Model parameters 


Parameters <5 5-59 >59 References 
Number of contacts per day Supplementary [28] [34], [35] 

Probability of getting infected through contact (%) i) closer circle: (3 to 10); ii) other contacts: (1 to 5) 34] 
Proportion of people remaining asymptomatic 0.8 [36], [37] 
Average incubation period (in days) 5 [23], [38] 

Average treatment duration (in days) 14 38] 
Proportion of hospitalized cases in ICU 0.11 [39], [40] 
Treatment duration in ICU (in days) Triangular (7, 8, 9) [38], [40] 

Proportion of people moving from ICU to critical illness (Ventilator) 0.88 40 

Treatment duration in ventilator state (in days) Triangular (5, 7, 12) 38 

Time between symptom arrival and admission (in days) 3 41] 

Proportion of people who die As per Indian statistics [3] 

Risk difference for use of control measures (percentage) i) Mask: 10.2; ii) Distancing: 14.3 42] 


2.3. Contact network 

The spread of infection is majorly governed by transmission rates and contact networks. The rate of 
transmission was varied from 3 to 10% and 1 to 5% for external contacts and household contacts respectively 
[34]. Contact rates for the present study were assumed to be density-dependent to be varied rationally across 
districts [15]. The probability with which any two agents meet was assumed to be inversely proportional to 
the distance between them. Kumar et al. [34] conducted a study in Ballabgarh, India to define contact rates 
for close-contact infections. This was used in integration with the density-dependent contact rate assumption 
to determine the contact distributions for all the districts. A multiplication factor (ratio of population density 
of the district under consideration to that of Ballabgarh) was used to find the proportionate corresponding 
contact rates for each district [15], [34]. Input analyzer tool of arena that helps to fit datasets into various 


distributions with corresponding errors was utilized to derive the distributions for contact rates of each 
district [43]. 


2.4. Disease model 

Disease models depict the progression of any disease through various states that govern the behavior 
of agents. Each agent exists in one of the states at any point in time which changes based on the conditions 
presented in the statechart as shown in Figure 1. State chart indicates the state of existence of agents in the 
simulation. 
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Figure 1. Statechart 
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At any instant, each agent can exist in one of the states mentioned in the statechart. These are 
governed by the actions that are defined for each agent during the simulation. The interaction of agents drives 
the transmission of infection from an infected to a healthy agent. Post transmission of infection, agents turn 
to be either asymptomatic or symptomatic. The latter undergo treatment after the incubation period while the 
former are untraceable and are not admitted for treatment. However, they spread the infection till recovery. 
Symptomatic agents further traverse along three states admitted, intensive care unit (ICU) and ventilator 
during which they either recover or decease. The conditions that govern the progression across states with the 


period of existence in each state are shown in Table 1. 


2.5. Model initialization 

Various variables that were used to develop the model were obtained from secondary sources 
including those from the models of infectious disease agent study (MIDAS) [44]. Spatial resolution cannot be 
attributed as the paths of agents are not considered rather the geo-coordinates are only taken into account. A 
time step of one day was considered for the simulation. 


2.6. Model simulation 

The Python code was simulated for the six scenarios defined in Table 2 to compare the effects of 
various NPIs [45]. The variations in strictness of lockdowns were taken care of by altering the contacts of 
agents at different locations such as home, schools, and work, depending on the place and age. The contacts 
outside the home were reduced in accordance with the stringency of lockdowns. 


Table 2. Risk estimations 


Duration . Number or people Relative Risk (95% Attributable Risk (AR) PAR 
(days) Scenario infected CD (95% CD PAR % 
ays Unexposed Exposed i ý 
0 to 104 MD100190 2554 69371 NA NA 
MD75190 21310 31228 0.48 (0.471, 0.506) -0.001 (-0.0014, -0.00137) -0.001 - 
60.41 
MDS50190 20348 14235 0.70 (0.678, 0.721) -0.0004(-0.00042, -0.0002 - 
-0.00038) 18.36 
MD1001180 2100 33437 NA NA 
MD751180 32016 55428 0.58 (0.563, 0.591) -0.0017(-0.00174, -0.0013 - 
-0.00166) 47.18 
MD501180 19685 12586 0.64 (0.617, 0.662) -0.0004(-0.00042, -0.0002 - 
-0.00038) 19.67 
105 t0204 =MD100190 0 6547839 NA NA 
MD75190 1938398 4330703 0.74 (0.743, 0.746)  -0.0624(-0.0627, -0.0621) -0.0468 - 
23.69 
MD50190 3266488 2541571 0.78 (0.777, 0.78) -0.0457 (-0.046, -0.0454) -0.0228 - 
12.46 
MD1001180 0 5668435 NA NA 
MD751180 1693952 3907223 0.77 (0.767, 0.77) -0.0493 (-0.0496, -0.049) -0.037 - 
20.96 
MD501180 3382489 2782060 0.82 (0.821, 0.824) -0.0378 (-0.0385, -0.0375) -0.0189 -9.74 
205 to 304 MD100190 0 5999937 NA NA 
MD75190 1841228 4716186 0.85 (0.852, 0.855) -0.034 (-0.0342, -0.0336) -0.0254 - 
12.29 
MD50190 3332683 2445789 0.73 (0.732, 0.735) -0.056 (-0.0562, -0.0556) -0.0279 - 
15.32 
MD1001180 0 1171268 NA NA 
MD751180 427366 1008006 0.79 (0.783, 0.79) -0.011 (-0.0117, -0.0113) -0.0086 - 
19.01 
MD501180 544102 297366 0.54 (0.542, 0.551) -0.015 (-0.0156, -0.0154) -0.0078 - 
29.42 


3. RESULTS AND DISCUSSION 

The effects of various NPIs such as lockdowns with varied stringency, adoption of social distancing, 
and use of face masks along with the impact of immunity on the spread of infection was observed over 365 
days by simulation of six scenarios. The six scenarios would be referred to as MD100190, MD75190, 
MDS50190, MD1001180, MD75I180, and MDS50I180 in subsequent sections. The numbers after ‘MD’ and ‘I’ 
indicate the proportion of the population exposed to control measures and immunity period (days) 
post-recovery respectively. Time-series graphs representing the number of people in asymptomatic and 
symptomatic states from the overall population are presented in Figure 2. The supplementary file contains 
time series data of individual districts. 
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Figure 2 reveals that the spread of infection has risen sharply after the lifting of lockdowns. The 
reason for subsequent spikes in Figure 2(a) as compared to Figure 2(b) and also in Figure 2(c) as compared to 
Figure 2(d) is due to the loss of immunity in Figures 2(a) and 2(c) after 90 days of recovery. Lockdowns 
prove to be the most effective control measure as the variation in terms of infections reduced relatively lesser 
even though a higher proportion of the population adopt social distancing and mask. However, there is a 
decline in the peaks of graphs in scenarios with a larger population exposed to interventions. 
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Figure 2. Time trend of infections (a) number of asymptomatic people (90 days immunity), (b) number of 
asymptomatic people (180 days immunity), (c) number of symptomatic people (90 days immunity), 
and (d) number of symptomatic people (180 days immunity) 


The possibilities of subsequent peaks are higher in the absence of vaccination, as observed in 
Figure 3. These graphs could improve the preparedness of the healthcare systems and shed light on the 
capacity required to treat the admitted infections as shown in Figures 3(a) and 3(b), make arrangements for 
ICU as shown in Figures 3(c) and 3(d), and ventilators as shown in Figures 3(e) and 3(f). The secondary 
infections are observed as people start to lose their immunity over time. There is a spike in the number of 
deceased people post-lockdown with subsequent peaks in accordance with the trend of infections. Prolonged 
immunity provides an additional time window for planning the capacity and vaccination policies. Figure 4 
shows a similar pattern with some temporal offset governed by the duration of existence in the preceding 
states. The second spike in cumulative infections is earlier in case of Figure 4(a) as compared to that of 
Figure 4(b) due to shorter immunity. Proportionate spikes are seen in Figures 4(c) and (d) indicating recovery 
post infections. Despite the high recovery rate, untraceable asymptomatic people pose a major challenge for 
curtailing the spread as they are highly untraceable to be isolated. 

The risk estimates as shown in Table 2 reveal the level of protection offered through various 
interventions. The interpretation of these estimates are: i) relative risk (RR) is the probability of an event 
occurring to exposed vs unexposed groups; ii) attributable risk (AR) indicates the excess risk due to a risk 
factor. A negative value indicates protection offered; iii) population attributable risk (PAR) indicates the 
percentage of cases in the total population that can be attributed to the risk factor; and iv) PAR% is the 
proportion of the incidence of disease in the population due to exposure. The first period of 104 days 
indicates the period after which the first recovered person would lose immunity. Successively, these 
parameters are calculated for further time intervals to analyze how they vary for different lockdown and 
intervention scenarios. The RR being lesser than 1 denotes that the exposure offers protection rather than risk 
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[46]. Lifting of lockdowns is the reason for the increase in RR over time indicating the reduction in the 
protection offered. Owing to the higher protection imparted by the exposure, MD75190 has a lower RR and 
PAR values than MDS50I90 in the first two timeframes as a higher proportion of the population are exposed in 
the former scenario. The fact that higher protection is offered during stricter lockdowns is evident from and 
PAR%. The lifting of lockdowns on the 143" day as shown in Table 3 has accelerated the transmission 
which has caused the values to peak drastically. The scenarios corresponding to 100 percent exposure i.e., the 
entire population follows control measures, have the least peak values. 

The research by the center for disease dynamics, economics and policy (CDDEP) and Princeton 
University complements the present study as it provides information on the estimated state-wise surge in 
India to help the healthcare fraternity to equip themselves [47]. Considering some other parameters such as 
clustering in contact networks, especially in the context of the spread of infections would provide improved 
results. The inclusion of movement patterns along with GIS would enhance the accuracy of estimates. Using 
wearable devices would offer real-time tracking of COVID-19 patients [48]. The structure of communication 
networks could be studied deeper to establish contact networks [49], [50]. Detailed analyses on dynamics of 
population and contact patterns have a strong scope to understand the spread of infections better. 
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Figure 3. Time trend of hospitalized cases (a) number of admitted people (90 days immunity), (b) number of 
admitted people (180 days immunity), (c) number of people in ICU (90 days immunity), (d) number of 
people in ICU (180 days immunity), (e) number of people on ventilators (90 days immunity), (f) number of 
people on ventilators (180 days immunity) 
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Figure 4. Total infections and recoveries (a) number of cumulative infections (90 days immunity), (b) 
number of cumulative infections (180 days immunity), (c) number of recoveries (90 days immunity), 
and (d) number of recoveries (180 days immunity) 


Table 3. Peak values 


Peak values/ MD100190 MD75190 MD50190 MD1001180 MD751180 MD501180 

Scenarios Value Day Value Day Value Day Value Day Value Day Value Day 
Asymptomatic 2491332 160 2950047 157 2842052 158 2416859 161 2440704 154 3341037 161 
Symptomatic 416618 152 498176 152 513427 152 377067 155 405197 152 532776 154 
Admitted 538683 166 639556 166 634877 166 511264 168 516807 164 715501 167 
ICU 37256 176 44739 175 46224 176 34393 179 37176 175 47353 178 
Ventilator 29641 184 35580 183 36589 184 27389 187 29574 183 37891 186 
Immune 6443023 228 6508198 333 6620998 358 6724187 323 6853529 302 6895030 325 


4. CONCLUSION 

Localized research just as the present one provides tailored and accurate insights that are more 
suitable to be materialized by policymakers for specific geographies. The use of the ABM approach promotes 
the level of detail offered to individuals in the population. Important factors such as protective factors could 
provide insights on the proportion of the population that would be shielded by imposing control measures. A 
total of 31738240 agents that represent 90.67% of Telangana’s population were generated to be used for 
simulation. The simulation coded in python was run to compare the six different NPI scenarios for 365 days. 
Time series corresponding to each health state were obtained for each district to get localized measures that 
could help policymaking. The study also measures the effect of the use of control measures and the role of 
immunity in the spread of infection. Understanding the variation in the spread of infection with respect to the 
interventions provide better insights to the policymakers on how to strategize the policies to curtail the spread 
in different areas. Defining interactions of agents based on GIS coordinates and considering contacts at 
workspace and closer circle allow us to show variations in the spread of disease during different lockdown 
setups. This has more practical implications to deliver healthcare services with capacity requirements to more 
vulnerable people. The ethical good practices in modeling and ISPOR-SMDM modeling good research 
practices have been adopted throughout the study. As evident from the results, the interventions help to 
curtail the transmission of the infection which provides more time window for the policymakers to devise apt 
strategies locally and to research on developing vaccination programs. Lockdown was found to be the most 
efficient intervention to curtail the spread as its lifting drastically increased the infections in a much shorter 
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time. The risk estimates further support these as the RR, AR, and PAR% values revealed higher protection 
during the lockdown period and in scenarios where a higher proportion of the population followed control 
measures. These values indicated the level of protection offered in each scenario and the proportion of the 
population that could be shielded by the control measures (exposure). The effect of immunity provides 
information about possible secondary infections after the loss of immunity. These estimates could be of 
practical significance to plan the interventions based on the population to be shielded. Limitations to the 
study include the exclusion of comorbidities, transportation modes, and indirect transmission through 
suspended particles, which could be considered to improve the accuracy. 


DATA AVAILABILITY 
The python code, supplementary file, and detailed district-wise estimates files shall be shared by the 
authors upon request. 
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